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1 A parallel g enetic algorithm for the graph partitioning problem 
E.-G. Talbi, P. Bessiere 

June 1991 Proceedings of the 5th international conference on Supercomputing 

Full text available: ^ pdf(801.46 KB) Additional Information: full citation , references , index terms 
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Pen computing: a technology overview and a vision Q 
Andre Meyer 

July 1995 ACM SIGCHI Bulletin, Volume 27 issue 3 

Full text available: ^pdf(5.14 MB) Additional Information: full citation , abstract , citings, index terms 

This work gives an overview of a new technology that is attracting growing interest in public 
as well as in the computer industry itself. The visible difference from other technologies is in 
the use of a pen or pencil as the primary means of interaction between a user and a 
machine, picking up the familiar pen and paper interface metaphor. From this follows a set 
of consequences that will be analyzed and put into context with other emerging 
technologies and visions. Starting with a short historic ... 



3 Simulation via implementation with applications in computer communication 
Kenneth Brayer, Valerie Lafleur, Gary Simpson 

March 1982 Proceedings of the 15th annual symposium on Simulation 

Full text available: g pdf(1.39 MB) Additional Information: full citation , abstract , index terms 

The traditional approach to performing discrete digital simulation has been one of 
developing a mathematical or statistical model to represent a process, programming this 
model on a large scale computer, and then executing the model to obtain performance 
results. In this study, the authors have developed a simulation of a computer 
communication network by simulating the users in a central computer and implementing 
the remainder of the network in actual network processors. This allows for au ... 

4 A Probabilistic-Based Design Methodology for Nanoscale Computation Q 
R. Iris Bahar, Joseph Mundy, Jie Chen 

November 2003 Pr ceedings f the 2003 IEEE/ACM internati nal c nference n 
C mputer-aided design 

Full text available: ^ pdf(354.52 KB) Additional Information: full citation , abstract , index terms 
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As current silicon-based techniques fast approach their practicallimits, the investigation of 
nanoscale electronics, devices andsystem architectures becomes a central research priority. 
It is expectedthat nanoarchitectures will confront devices and interconnectionswith high 
inherent defect rates, which motivates the searchfor new architectural paradigms. In this 
paper, we propose a probabilistic-based design methodologyfor designing nanoscale 
computer architectures based onMarkov Random Fields (M ... 

5 98^/Mflops/s ultra-large-scale neural-network training on a pill cluster 
Douglas A. Aberdeen, Jonathan Baxter, Robert Edwards 

November 2000 Proceedings of the 2000 ACM/IEEE conference on Supercomputing 



Artificial neural networks with millions of adjustable parameters and a similar number of 
training examples are a potential solution for difficult, large-scale pattern recognition 
problems in areas such as speech and face recognition, classification of large volumes of 
web data and finance. The bottleneck is that neural network training involves iterative 
gradient descent and is extremely computationally intensive. In this paper we present a 
technique for distributed training of Ultra Large ... 

Keywords: neural-network, Linux cluster, matrix-multiply 



6 Neural networks and dynamic complex systems |_j 
Geoffrey Fox, Wojtek Furmanski, Alex Ho, Jeff Koller, Peter Simic, Isaac Wong 

March 1989 Proceedings of the 22nd annual symposium on Simulation 

Full text available: Q pdfM.44 MB) Additional Information: full citation , abstract , references , index terms 

We describe the use of neural networks for optimization and inference associated with a 
variety of complex systems. We show how a string formalism can be used for parallel 
computer decomposition, message routing and sequential optimizing compilers. We extend 
these ideas to a general treatment of spatial assessment and distributed artificial 
intelligence. 

7 An architecture for extended abstract data flow Q 
Vason P. Srini 

May 1981 Proceedings of the 8th annual symposium on Computer Architecture 



A distributed computer system environment for executing extended abstract data flow 
graphs (EDFGs) is presented. Sequencing functions in EDFGs depends on the availability of 
data. Since the functions are free of side effects, unrelated functions can be executed in 
parallel if the required data is available. The environment comprises an organization of 
conventional or data flow processors and the kernel functions of a distributed operating 
system. The processors are organized in g ... 

8 Architectures: A perspective on the future of massively parallel computing: fine-grain I I 
vs. coarse-grain parallel models comparison & contrast 
Predrag T. Tosic 

April 2004 Pr ceedings f the first c nference nc mputing fr ntiers nC mputing 
fr ntiers 

Full text available: fiQ pdf(277,49 KB) Additional Information: full citation , abstract , references , index terms 



Models, architectures and languages for parallel computation have been of utmost research 





Additional Information: full citation , abstract , references , citings , index 
terms 
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interest in computer science and engineering for several decades. A great variety of parallel 
computation models has been proposed and studied, and different parallel and distributed 
architectures designed as some possible ways of harnessing parallelism and improving 
performance of the general purpose computers.Afass/Ve// parallel connectionist models such 
as artificial neural networks ( ... 

Keywords: cellular automata, distributed systems, massively parallel computing, 
multiprocessor computers, neural networks, parallel computation models 



9 What have we learnt from using real parallel machines to solve real problems? Q 
G. c. Fox 

January 1989 Proceedings of the third conference on Hypercube concurrent computers 

and applications - Volume 2 

r- ,. * , ui 0i m,a no md\ Additional Information: full citation , abstract , references , citings , index 
Full text available: 7T] pdf(4.Q8 MB) ; 

terms 

We briefly review some key scientific and parallel processing issues in a selection of some 
84 existing applications of parallel machines. We include the MIMD hypercube transputer 
array, BBN Butterfly, and the SIMD ICL DAP, Goodyear MPP and Connection Machine from 
Thinking Machines. We use a space-time analogy to classify problems and show how a 
division into synchronous, loosely synchronous and asynchronous problems is helpful. This 
classifies problems into those suitable for SIMD or MIMD ... 

10 Load balancing loosely synchronous problems with a neural network Q 
G. C. Fox, W. Furmanski 

January 1988 Proceedings of the third conference on Hypercube concurrent computers 
and applications: Architecture, software, computer systems, and general 
issues - Volume 1 

r- ii * * i ut 01 ^*/o on kao\ Additional Information: full citation , abstract , references , citings , index 
Full text available: TO pdf(2.90 MB) - 

terms 

Hopfield and Tank have introduced the use of neural networks for the solution of 
optimization problems such as the traveling salesman problem. Here we show how to 
generalize this method to decompose loosely synchronous problems onto parallel machines 
and in particular the hypercube. In this case, decomposition or load balancing can be 
formulated graph theoretically in terms of optimal partitioning of the computational graph 
into N = 2 

11 Identification of parallelism in neural networks by simulation with language J. Q 
Alexei N. Skurihin, Alvin 3. Surkan 

September 1993 ACM SIGAPL APL Quote Quad , Proceedings of the international 
conference on APL, volume 24 issue i 

Full text available- Hi pdf(588 94 KB) Addltional Information: full citation, abstract , references , citings , index 
' la-^- J ! terms 

Neural networks, trained by backpropagation, are designed and described in the language 
3, an APL derivative with powerful function encapsulation features. Both the languages J 
[4,6,7] and APL [5] help to identify and isolate the parallelism that is inherent in network 
training algorithms. Non-critical details of data input and derived output processes are de- 
emphasized by relegating those functions to callable stand-alone modules. Such input and 
output ... 

12 A constructive algorithm for neural networks that generalize Q 
Alvin Surkan, Colin Campbell 

January 1998 ACM SIGAPL APL Qu te Quad , Pr ceedings f the c nference n Share 

kn wledge share success, volume 28 issue 4 
Full text available: ^ pdf(650.Q4 KB) Additional Information: full citation , abstract , index terms 
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APL functions were designed to describe a constructive algorithm that synthesizes a neural 
network while optimizing its ability to generalize. Algorithms are implemented in programs 
to discover networks of binary weights that assign unfamiliar, high-dimension binary 
patterns to their most similar classes. Constructive algorithms that create networks are 
important for the design of classifiers based on array- processors made from fast two-level 
circuits. APL is an effective tool for the exposition ... 

13 Ariel: a scalable multiprocessor for the simulation of neural networks 
Gary Frazier 

March 1990 ACM SIGARCH Computer Architecture News, volume 18 issue l 
Full text available: ^ pdf(729.87 KB) Additional Information: full citation , abstract , index terms 

Ariel is a multiprocessor architecture that we are developing to simulate neural networks 
and other models of distributed computation. The design is based upon a hierarchical 
network of coarse-grained processing modules. The module hardware uses fast digital 
signal processors and very large semiconductor memories to provide the throughput and 
storage capacity required to simulate large networks. Our objective is to provide a system 
that can be scaled up to simulate neural networks compose ... 

14 Critical issues in mapping neural networks on message-passing multicomputers I I 
J. Ghosh, K. Hwang 

May 1988 ACM SIGARCH Computer Architecture News , Proceedings of the 15th 

Annual International Symposium on Computer architecture, volume 16 issue 2 
Full text available: ^ pdf(1.05 MB) Additional Information: full citation , abstract , references , index terms 

Connectionist models such as artificial neural systems, offer an intrinsically concurrent 
computational paradigm. We investigate the architectural requirements for efficiently 
simulating large neural networks on a multicomputer system with thousands of fine-grained 
processors and distributed memory. First, models for characterizing the structure of a 
neural network and the function of individual cells are developed. These models provide 
guidelines for efficiently mapping the network onto mu ... 

1 5 High speed neural network chip for trigger purposes in high energy physics I I 
W. Eppler, T. Fischer, H. Gemmeke, A. Menchikov 

February 1998 Proceedings of the conference on Design, automation and test in Europe 

Full text available: fB pdfd 16.42 KB) 

Jst Additional Information: full citation , abstract , references , index terms 

Publisher Site 

A novel neural chip SAND (Simple Applicable Neural Device) is described. It is highly usable 
for hardware triggers in particle physics. The chip is optimized for a high input data rate (50 
MHz, 16 bit data) at a very low cost basis. The performance of a single SAND chip is 200 
MOPS due to four parallel 16 bit multipliers and 40 bit adders working in one clock cycle. 
The chip is able to implement feedforward neural networks with a maximum of 512 input 
neurons and three hidden layers. Kohonen feat ... 

Keywords: VME board with neural network chip SAND, Hardware accelerator for neural 
networks, High energy physics : trigger, on- and off-line analysis 




16 Transparent remote execution in LAHNOS by means of a neural network device I I 
M. Cena, M. L. Crespo, R. Gallard 

January 1995 ACM SIGOPS Operating Systems Review, volume 29 issue l 

Full text available: ^ pdf(596.41 KB) Additional Information: full citation , abstract , index terms 

LAHNOS is a Local Area Heterogeneous Operating System [1] being currently developed at 
the Universidad Nacional de San Luis over which distributed services are to be built. This 
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paper shows some enhancements to be introduced into the original design in order to 
achieve automatic allocation of remote execution requests to the best fitted node under 
some chosen performance criteria. Following current trends in Distributed Systems we 
propose the insertion of a neural network device into the kernel o ... 

Keyw rds: neural networks, operating system, workload 



17 Mark lllfp hvpercube concurrent processor architecture Q 
J. Tuazon, J. Peterson, M. Pniel 

January 1988 Proceedings of the third conference on Hypercube concurrent computers 
and applications: Architecture, software, computer systems, and general 
issues - Volume 1 

Full text available* ISil Ddfd 02 MB) Additional Information: full citation , abstract , references , citings , index 
' *■ terms 

The Mark lllfp Hypercube is a new generation of hypercube concurrent processor system 
developed at JPL/Caltech, with peak performance of 5 Mips, 14 Mflops per node, and a peak 
communication rate of 6 Mbytes per second. Each node utilizes two Motorola MC68020 
microprocessors, an MC68882 scalar floating- point coprocessor, and a Weitek 8000 
floating-point chip set. One of the MC68020 processors serves as the application and 
computational processor, the other is dedicated to communication. The ... 

18 A survey of commercial parallel processors I I 
Edward Gehringer, Janne Abullarade, Michael H. Gulyn 

September 1988 ACM SIGARCH Computer Architecture News, volume 16 issue 4 

Full text available: ^ pdf(2.96 MB) Additional Information: full citation , abstract , citings , index terms 

This paper compares eight commercial parallel processors along several dimensions. The 
processors include four shared-bus multiprocessors (the Encore Multimax, the Sequent 
Balance system, the Alliant FX series, and the ELXSI System 6400) and four network 
multiprocessors (the BBN Butterfly, the NCUBE, the Intel iPSC/2, and the FPS T Series). The 
paper contrasts the computers from the standpoint of interconnection structures, memory 
configurations, and interprocessor communication. Also, the share ... 

19 X-Tree: A tree structured multi-processor computer architecture I I 
Alvin M. Despain, David A. Patterson 

April 1978 Proceedings of the 5th annual symposium on Computer architecture 

Full text available* fSl Ddf(535 88 KB) Additiona ' Information: full citation , abstract , references , citings , index 

terms 

The problem of organizing multiple, monolithic microprocessors into an effective general 
purpose computer structure is examined. A tree structure with extra interconnections was 
found to be especially attractive. It provides a structured hierarchy for control, addressing 
and message routing. More important, it appears to provide a mechanism to automatically 
migrate data abstractions and processes over the network of processors. The network can 
be expanded to any desired size and no global c ... 

20 Multiprocessor simulation of neural networks with NERV I I 
R. Manner, R. Horner, R. Hauser, A. Genthner 

August 1989 Pr ceedings f the 1989 ACM/IEEE c nference n Superc mputing 

Full text available: ^pdf(1.08 MB) Additional Information: full citation , abstract , references , index terms 

A general-purpose simulation system for neural networks is computationally very 
demanding. This paper presents some estimations of the computing power required, the 
necessary interconnection bandwidth, and the requisite memory size. Next, the hardware 
architecture of the NERV multiprocessor system is derived that fulfills these requirements. 
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Up to 320 processors 68020 are used in a single VME crate together with a Macintosh II as 
a host computer. This set-up provides a computing power of 13 ... 
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21 On a learnability question associated to neural networks with continuous activations I I 
(extended abstract) 

Bhaskar DasGupta, Hava T. Siegelmann, Eduardo Sontag 

July 1994 Proceedings of the seventh annual conference on Computational learning 
theory 

Full text available: ^S |pdf(1.01 MB) Additional Information: full citation , abstract , references , index terms 

This paper deals with learnability of concept classes defined by neural networks, showing 
the hardness of PAC-learning (in the complexity, not merely information-theoretic sense) 
for networks with a particular class of activation. The obstruction lies not with the VC 
dimension, which is known to grow slowly; instead, the result follows the fact that the 
loading problem is NP-complete. (The complexity scales badly with input dimension; the 
loading problem is polynomial-time ... 



22 Integrating neural networks with special purpose simulation 
Dany Hajjar, Simaan AbouRizk, Kevin Mather 

December 1998 Proceedings of the 30th conference on Winter simulation 

Full text available: f5| pdf(128,17 KB) Additional Information: full citation , references , index terms 
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23 Association-based natural language processing with neural networks 
Kimura Kazuhiro, Suzuoka Takashi, Amano Sin-ya 

June 1992 Proceedings of the 30th conference on Association for Computational 
Linguistics 

Full text available: pdf(450.05 KB) 

jfiT" Additional Information: full citation , abstract , references 

Ifl P Publisher Site 

This paper describes a natural language processing system reinforced by the use of 
association of words and concepts, implemented as a neural network. Combining an 
associative network with a conventional system contributes to semantic disambiguation in 
the process of interpretation. The model is employed within a kana-kanji conversion system 
and the advantages over conventional ones are shown. 

24 Principles of runtime support for parallel processors I I 
R. Mirchandaney, J. H. Saltz, R. M. Smith, D. M. Nico, K. Crowley 
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June 1988 Pr ceedings f the 2nd internati nal c nference n Superc mputing 

CIU . i u. « „a«4 oo mq\ Additional Information: full citation , abstract , references , citings , index 

Full text available: TH pdf(1.32 MB) - 

^ terms 

There exists substantial data level parallelism in scientific problems. The PARTY runtime 
system is an attempt to obtain efficient parallel implementations for scientific computations, 
particularly those where the data dependencies are manifest only at runtime. This can 
preclude compiler based detection of certain types of parallelism. The automated system is 
structured as follows: An appropriate level of granularity is first selected for the 
computations. A directed acyclic graph represent .:. 

25 Session 9A: System level test and reliability: Accurate CMOS bridge fault modeling I I 
with neural network-based VHDL saboteurs 
Don Shaw, Dhamin Al-Khalili, Come Rozon 

November 2001 Proceedings of the 2001 IEEE/ACM international conference on 

Computer-aided design 

.. . . a 7n ^ D v Additional Information: full citation , abstract , references , citings , index 

Full text available: Tq pdf(137.79 KB) 

^ terms 

This paper presents a new bridge fault model that is based on a multiple layer feedforward 
neural network and implemented within the framework of a VHDL saboteur cell. Empirical 
evidence and experimental results show that it satisfies a prescribed set of bridge fault 
model criteria better than existing approaches. The new model computes exact bridged 
node voltages and propagation delay times with due attention to surrounding circuit 
elements. This is significant since, with the exception of full ... 

Keywords: CMOS ICs, VHDL, bridge defects, fault models, fault simulation, neural 
networks 



26 Problems and approaches for a Teraflop processor 
A. H. Frey, G. C. Fox 

January 1988 Proceedings of the third conference on Hypercube concurrent computers 
and applications: Architecture, software, computer systems, and general 
issues - Volume 1 

Full text available: ^ pdf(463.59 KB) Additional Information: full citation , abstract , references , index terms 

This paper discusses problems associated with designing a processor capable of sustaining 
a teraflop (1012 floating point operations per second) of processing power. Several 
researcher have speculated on achieving this performance. The technical problems of a 
practical design are shown to be formidable. However, none of these problems requires a 
technology breakthrough for their solution. The predictable advances of the next generation 
of technology together with a majo ... 

27 Embedded systems: Accurate software performance estimation using domain 
classification and neural networks 

Marcio Seiji Oyamada, Felipe Zschornack, Flavio Rech Wagner 

September 2004 Proceedings of the 17th symposium on Integrated circuits and system 
design 

Full text available: ^pdfd 37.87 KB) Additional Information: full citation , abstract , references , index terms 

For the design of an embedded system, there is a variety of available processors, each one 
offering a different trade-off concerning factors such as performance and power 
consumption. High-level performance estimation of the embedded software implemented in 
a particular architecture is essential for a fast design space exploration, including the choice 
of the most appropriate processor. However, advanced architectures present many 
features, such as deep pipelines, branch prediction mechanisms an ... 
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28 Optimal communication algorithms for regular decompositions on the hypercube Q 
G. C. Fox, W. Furmanski 

January 1988 Proceedings of the third conference on Hypercube concurrent computers 
and applications: Architecture, software, computer systems, and general 
issues - Volume 1 

Additional Information: full citation , abstract , references , citings , index 



Full text available: ^ w ____ 

l£3 "^ terms 

We discuss optimal communication and decomposition algorithms for a class of regular 
problems on concurrent computers with a hypercube topology, using a general technique 
we call the method of cube geodesies. We address the calculation of various 
transformations ( convolutions, functional etc. ) of data distributed over the hypercube; 
examples are the Fast Fourier Transform, matrix algorithms, global scalar products and 
vector sums, sorting. These all involve long distance inter ... 

29 Fuzzy RuleNet: an artificial neural network model for fuzzy classification 
Nadine Tschichold-Gurman 

April 1994 Proceedings of the 1994 ACM symposium on Applied computing 

Full text available: ^| pdf(571.46 KB) Additional Information: full citation , references , index terms 



30 A QoS-Provisioninq neural fuzzy connection admission controller for multimedia high- I I 
s peed networks 

Ray-Guang Cheng, Chung-Ou Chang, Li-Fong Lin 

February 1999 IEEE/ACM Transactions on Networking (TON), volume 7 issue l 

Full text available: ^ pdf(342.90 KB) Additional Information: full citation , references , citing s, index terms 



31 Interprocessor communication speed and performance in distributed-memor y parallel I I 
processors 

M. Annaratone, C. Pommerell, R. Ruhl 

April 1989 ACM SIGARCH Computer Architecture News , Proceedings of the 16th 

annual international symposium on Computer architecture, volume 17 issue 3 

Additional Information: full citation , abstract , references , citings , index 



Full text available: - 

^ terms 

We have simulated several numerical and non-numerical algorithms on five distributed- 
memory parallel processors (DMPPs). All five DMPPs have the same topology (a torus), and 
the same number of nodes. The architectures differ only in the communication speed 
between neighboring nodes, while the computation unit is kept unchanged. The goal of the 
paper is to quantify the effect that interprocessor communication speed and synchronization 
overhead have on the performance of the DMPPs. Af ... 

32 Software protection: Attacks and risk analysis for hardware supported software copy Q 
protection systems 

Weidong Shi, Hsien-Hsin S. Lee, Chenghuai Lu, Tao Zhang 

October 2004 Pr ceedings f the 4th ACM w rksh p n Digital rights management 

Full text available: ^ pdf(1 67.21 KB) Additional Information: full citation , abstract , references , index terms 

<i>Recently, there is a growing interest in the research community to use tamper-resistant 
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processors for software copy protection. Many of these tamper-resistant systems rely on a 
specially tailored secure processor to prevent, 1) illegal software duplication, 2) 
unauthorized software modification, and 3)unauthorized software reverse engineering. The 
published techniques primarily focused on feasibility demonstration and design details 
rather than analyzing security risks and potential a ... 
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33 Hypermedia & TAL: Semantic Network Array Processor as a massively parallel I I 
computing platform for high performance and large-scale natural language processing 
Hiroaki Kitano, Dan Moldovan 

August 1992 Proceedings of the 14th conference on Computational linguistics - Volume 
2 

Full text available: ^ pdf(582.00 KB) Additional Information: full citation , abstract , references 

This paper demonstrates the utility of the Semantic Network Array Processor (SNAP) as a 
massively parallel platform for high performance and large-scale natural language 
processing systems. SNAP is an experimental massively parallel machine which is dedicated 
to, but not limited to, the natural language processing using semantic networks. In 
designing the SNAP, we have investigated various natural language processing systems and 
theories to determine the scope of the hardware support and a set o ... 
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Angela Sodan, Guang R. Gao, Olivier Maquelin, Jens-Uwe Schultz, Xin-Min Tian 

June 1997 ACM SIGPLAN Notices , Proceedings of the sixth ACM SIGPLAN symposium 

on Principles and practice of parallel programming, volume 32 issue 7 
Full text available* Ddfd 32 MB) Additional Information: full citation , abstract , references , citings , index 
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Distributed-memory machines have proved successful for many challenging numerical 
programs that can be split into largely independent computation-intensive subtasks 
requiring little data exchange (although the amount of exchanged data may be large). 
However, many irregular applications— e.g. in the AI field — have a fairly tight data 
coupling that often results from the use of shared data structures, making them in many 
cases not amenable to parallelization on distributed-memory machines. EAR ... 

35 On the computational power of neural nets I I 
Hava T. Siegelmann, Eduardo D. Sontag 

July 1992 Proceedings of the fifth annual workshop on Computational learning theory 

Full text available* 153 Ddfd 02 MB) Additional Information: full citation , abstract , references , citings , index 
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This paper deals with finite networks which consist of interconnections of synchronously 
evolving processors. Each processor updates its state by applying a "sigmoidal" scalar 
nonlinearity to a linear combination of the previous states of all units. We prove that one 
may simulate all Turing Machines by rational nets. In particular, one can do this in linear 
time, and there is a net made up of about 1,000 processors which computes a universal 
partial-recursive function. Products ... 

36 Learning Ensembles from Bites: A Scalable and Accurate Approach I I 
Nitesh V. Chawla, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer 

August 2004 The J urnal f Machine Learning Research, Volume 5 

Full text available: ^ pdf(3.34 MB) Additional Information: full citation , abstract , index terms 

Bagging and boosting are two popular ensemble methods that typically achieve better 
accuracy than a single classifier. These techniques have limitations on massive data sets, 
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because the size of the data set can be a bottleneck. Voting many classifiers built on small 
subsets of data ("pasting small votes") is a promising approach for learning from massive 
data sets, one that can utilize the power of boosting and bagging. We propose a framework 
for building hundreds or thousands of such classifie ... 
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May 1988 ACM SIGARCH Computer Architecture News , Proceedings of the 15th 

Annual International Symposium on Computer architecture, volume 16 issue 2 

Full te)davailable:mM(MOMBl Additional lnformation: Mcitation. abstract, references , citings, index 
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This paper presents the design and implementation of a high-performance special-purpose 
processor, called The White Dwarf, for accelerating finite element analysis algorithms. The 
White Dwarf CPU contains two Am29325 32-bit floating-point processors and one Am29332 
32-bit ALU, and employs a wide-instruction word architecture in which the application 
algorithm is directly implemented in microcode. The entire system is VME-bus compatible 
and interfaces with a SUN 31160 host. The syste ... 

38 Computing with structured connectionist networks Q 
Jerome A. Feldman, Mark A. Fanty, Nigel H. Goddard, Kenton J. Lynne 

February 1988 Communications of the ACM, volume 31 issue 2 

Full text availabler mfidflimMB) Additional Information: M citation » ^stract, references , citings, index 

terms , review 

The design and applications of massively parallel computational models could lead to 
dramatic advances in the ability to automate complex tasks such as those found in artificial 
intelligence. 
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41 Computational models: BLOB computing 

Frederic Gruau, Yves Lhuillier, Philippe Reitz, Olivier Temam 

April 2004 Proceedings of the first conference on computing frontiers on Computing 
frontiers 

Full text available: ^ pdf(1,02 MB) Additional Information: full citation , abstract , references , index terms 

Current processor and multiprocessor architectures are almost all based on the Von 
Neumann paradigm. Based on this paradigm, one can build a general-purpose computer 
using very few transistors, e.g., 2250 transistors in the first Intel 4004 microprocessor. In 
other terms, the notion that on-chip space is a scarce resource is at the root of this 
paradigm which trades on-chip space for program execution time. Today, technology 
considerably relaxed this space constraint. Still, few research works q ... 
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January 1988 Proceedings of the third conference on Hypercube concurrent computers 
and applications: Architecture, software, computer systems, and general 
issues - Volume 1 

Full text available* -fB df(4 66 00 KB) Additional Information: full citation , abstract , references , citings , index 
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We discuss communication algorithms relevant for neural network modeling on distributed 
memory concurrent computers with a hypercube topology. Full, intermediate (medium 
range) and sparse network connectivities are analyzed. We point out that the flexible 
hypercube topology allows for the efficient implementation of the broad class of network 
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Patricia A. Carlson 
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Proceedings of the 8th annual international conference on Systems 
documentation, volume 14 issue 4 

Full text available: ^ pdf(1.35 MB) Additional Information: full citation , abstract , references , index terms 

Computers are cognitive tools — they extend the capabilities of the human mind. Paper and 
pencil are also cognitive tools — they enhance human memory by acting as a permanent 
record, and they mediate the formation of thought by serving as a scratchpad or rehearsal 
device. However, there is a qualitative difference between these cognitive tools: the 
computer as a writing environment can become an active participant in the process while 
paper and pencil mu ... 

45 Exploiting global input/output access pattern classification 
Tara M. Madhyastha, Daniel A. Reed 

November 1997 Proceedings of the 1997 ACM/IEEE conference on Supercomputing 
(CDROM) 

Full text available: ^ pdf(232.01 KB) Additional Information: full citation , abstract , references , citings 

Parallel input/output systems attempt to alleviate the performance bottleneck that affects 
many input/output intensive applications. In such systems, an understanding of the 
application access pattern, especially how requests from multiple processors for different 
file regions are logically related, is important for optimizing file system performance. We 
propose a method for automatically classifying these global access patterns and using these 
global classifications to select and tune file syst ... 
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Jiun-Ming Hsu, Prithviraj Banerjee 

November 1990 Proceedings of the 1990 ACM/IEEE conference on Supercomputing 

Full text available: Qpdf(1.25 MB) Additional Information: full citation , abstract , references , citings 

This paper presents the architecture, methodology and performance evaluation of a 
message passing coprocessor (MPC) which can accelerate message communication in a 
distributed memory multicomputer. The MPC is a microprogrammable processor which off- 
loads the CPU of the burden of communication and speeds up the software processing by 
directly executing message passing instructions in microcode. It supports process 
scheduling, message buffer management, and fast buffer copying. The most uni ... 

47 An approach to fault tolerance and error recovery in a parallel graph reduction 

machine: MaRS — a case study 
Alessandro Contessa 

June 1988 ACM SIGARCH Computer Architecture News, volume 16 issue 3 
Full text available: ^ pdf(958.58 KB) Additional Information: full citation , abstract , index terms 

This article deals with the issue of fault tolerance and error recovery in a parallel graph 
reduction computer such as the "MaRS" machine presently under development at CERT. 
This is a multiprocessor system with decentralized control and asynchronous, delayed 
communications between cooperating, tightly coupled processes. A solution for the problem 
of MaRS error recovery is derived, based on the machine's execution model (successive 
reductions performed on the program graph, i.e. evaluations on t ... 
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annual international symposium on Computer architecture, volume 23 issue 2 
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Programming models based on messaging continue to be an important programming model 
for parallel machines. Messaging costs are strongly influenced by a machine's network 
interface architecture. We examine the impact of architectural support for messaging in two 
machines — the TMC CM-5 and the Cray T3D — by exploring the design and performance 
of several messaging implementations. The additional features in the T3D support remote 
operations: memory access, fetch-and-increment, atomic swaps, a ... 

50 Graph contraction for physical optimization methods: a quality-cost tradeoff for I I 
mapping data on parallel computers 

N. Mansour, R. Ponnusamy, A. Choudhary, G. C. Fox 

August 1993 Proceedings of the 7th international conference on Supercomputing 
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Mapping data to parallel computers aims at minimizing the execution time of the associated 
application. However, it can take an unacceptable amount of time in comparison with the 
execution time of the application if the size of the problem is large. In this paper, first we 
motivate the case for graph contraction as a means for reducing the problem size. We 
restrict our discussion to applications where the problem domain can be described using a 
graph (e.g., computational fluid dynamics appl ... 
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R. P. Pargas, D. E. Wooster 

January 1989 Proceedings of the third conference on Hypercube concurrent computers 
and applications - Volume 2 
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The parallel implementation of branch-and-bound algorithms on multiprocessors has 
received an increasing amount of attention in recent years;. This paper describes research 
currently being conducted at Clemson University. We study the job scheduling problem: 
given n jobs, each with processing time, pi, and due date, di, schedule these jobs on a 
single machine such that total tardiness is minimized. Total tardiness, T, is defined as: T = 
nli=l w... 
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